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MINIMA IN BRANCHING RANDOM WALKS 

By Louigi Addario-Berry^ and Bruce Reed 
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Given a branching random walk, let Af„ be the minimum position 
of any member of the nth generation. We calculate EM„ to within 
0(1) and prove exponential tail bounds for P{|M„ — EMn | > x}, un- 
der quite general conditions on the branching random walk. In par- 
ticular, together with work by Bramson [Z. Wahrsch. Verw. Gebiete 
45 (1978) 89-108], our results fully characterize the possible behavior 
of EM„ when the branching random walk has bounded branching 
and step size. 

1. Introduction. The object of study in this paper is a supercritical 
branching random walk (or tree-indexed random walk) — which we view as 
a Galton- Watson tree T with root r and offspring distribution B, each of 
whose edges e has been augmented (or labeled) with an independent copy Xe 
of a random variable X (which we call the step size). For the formal details 
of a probabilistic construction of branching random walks, see, for example, 
Harris [23]. When X is nonnegative, this is often called an age-dependent 
branching process. 

We assign to each node v T the "displacement" (or label) which is 
the sum of the edge labels on the path from r to v. The depth of a node v €T 
is the number of edges on the path from r to -y. One of the most well-studied 
parameters associated with branching random walks is the minimum after 
n steps, that is, the minimum value of 5^, over all nodes v having depth n in 
T; we denote this quantity M^. When X is nonnegative, M„ may be viewed 
as the time at which the first element of the nth generation is born. For 
several special choices of branching random walk, M„ also turns out to be 
closely linked to the performance of data structures that arise in computer 
science; this connection is explored in depth by Devroye [18]. 
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The starting point for our research is the Biggins-Hammersley-Kingman 
theorem, which provides a law of large numbers for the minimum displace- 
ment M„ of a branching random walk after n steps. (We set M„ = oo if the 
process does not survive for n generations.) In a sequence of papers, each 
building on the results of the last, Hammersley [22], Kingman [25] and Big- 
gins [6] showed that under suitable conditions on the exponential moment 
of X, there is a finite constant 7 such that, conditional on the survival of 
the branching process, 

M„/n^7 almost surely. 

When Hammersley initiated this research into the first-order behavior of 
Mn , he posed several questions to which complete answers remain unknown. 
In particular, he asked when more detailed information about M„ — 
than that given by the above law of large numbers can be found, about the 
expectation of M„, and about whether the higher centralized moments of 
Mn (and particularly the variance) are bounded. Our goal in this paper is to 
provide answers to Hammersley's questions for a broad range of branching 
random walks. Before we state our results, however, we briefly discuss some 
previous work on this and related problems. 

In a remarkable paper, Bramson [7] derived extremely precise information 
about the maximum displacement of branching Brownian motion. In this 
model, an initial particle starts at position on the real line and begins a 
standard Brownian motion with variance 1 . The particle decays according to 
an exponential mean 1 clock; when the clock goes off, the particle splits into 
two, each of which continues an independent Brownian motion and each of 
which independently decays according to an exponential mean 1 clock. This 
process is continued forever. Bramson studied the maximum displacement 
M^^ of any particle after time t (of course, his results apply immediately 
to the minimum, by symmetry). He showed that the median m^"' of M^^ 
satisfies 

(1) mr = V2t-^+c + o(l), 

where c is a fixed constant, and additionally showed that M^"^ — m^^ con- 
verges in distribution. (In this paper, by log we always mean the natural 
logarithm.) It is worth noting that Lifshits [26] has apparently (see Hu and 
Shi [24]) provided an example of a branching random walk for which M„ 
minus its median is tight but does not converge in distribution, so in general 
a result as strong as Bramson's will not hold in the branching random walk 
setting. 

Since Bramson's work, it has been expected that for at least some branch- 
ing random walks, the median m„ of M„ should exhibit similar behavior. 
However, such results have been slow in coming. McDiarmid [27] proved 
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(among other results) that for a wide class of branching random walks, 
-^n — "yn = O(logn) almost surely. Bachmann [3, 4] studied the tightness 
of the family {Af„ — m„ : n E N}, where is the median of M„, showing 
that the aforementioned family is tight when X has logarithmically concave 
distribution function. (We have not stated all the conditions on X and B 
required for his result.) More recently, Bramson and Zeitouni [9, 10] also 
proved tightness of the family {M„ — :n£ N}, under a strong local con- 
dition on the lower tail of X. 

In the course of studying the height of random binary search trees (RB- 
STs) , Reed [28] proved that for a branching random walk with deterministic 
binary branching and exponential mean 1 step size, there exist explicit con- 
stants a > 0, (3 > such that 



the first such result for any branching random walk; Reed additionally 
proved exponential tail bounds for P{|M„ — m„| > x}. We remark that the 
difference in sign of the logarithmic terms in (1) and (2) is because the 
former equation bounds a maximum, whereas the latter equation bounds 
a minimum. Drmota [21] has also published an alternative proof of Reed's 
result, using completely different techniques. Chauvin and Drmota [13] have 
since extended Drmota's techniques to show exponential tail bounds for 
M„ around m„ when (among other conditions) the step size X satisfies 
what they call an intersection property. In particular, their result implies 
that such tail bounds hold when X has logarithmically concave distribution 
function. In a series of papers, Broutin and Devroye [11], Broutin, Devroye 
and McLeish [12] have recently extended the Hammersley-Kingman-Biggins 
theorem to a wide range of branching random walks in which dependency is 
allowed among the steps and in which the "cutoff" at which the minimum 
is measured is not deterministically n, but is determined by a second family 
of random variables coexisting on the edges of the Galton- Watson tree T. 

The results and techniques of Bramson [7] and Reed [28], in particu- 
lar, serve as guidance and inspiration for the work of this paper. For our 
approach, the asymptotic behavior of M„ and of its median m„ are best 
characterized in terms of the behavior of the logarithmic moment generating 
function (LMGF) A. For a real random variable X, we define the LMGF 
A = Ax of X by 



To better understand the utility of the function A in studying Mn, we first 
recall Chernoff 's bounding technique: if Sn = J2i'=i is a sum of n indepen- 
dent copies of X, then for any c G M and u > 0, by using Markov's inequality 
we have 



(2) 



rrin = an + (]logn + 0(1) 



A(t)=Ax(i) = logE{e*-^}. 



P{Sn < cn} = P{e 
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so 



(3) P{Sn < cn} < 



inf E{e"(^~"H 

u>0 



If there is u such that 'E{e'^^^~^^} < (EB)~^ (where B is the branching 
distribution) then it fohows that F{Sn < cn} < (Ei?)~", and it is fairly 
easy to show, using this fact and straightforward bounds on the growth 
of r, that lim mi oo Mn/n > c. (To show that in this case liuin^oo Mn/n 
actuahy exists and to determine the hmit takes more work.) 

The optimal choice for u in (3) is that for which A'(n) = c, as may be 
informally seen by differentiating 'E{e'^^^~^^} with respect to u. Choosing u 
in this fashion, and writing A'{u) in place of c, gives the bound 

(4) P{5„ < A'{u)7i} < [E{e"^} • e'^^'^")]" = e-"("^'(")-^(")). 

It turns out that the upper bound given by (4) is almost tight; this is the 
substance of the "exact asymptotics for large deviations" first proved by 
Bahadur and Rao [5], and is the reason that the behavior of A is key to our 
investigation. As our proof leans strongly upon the exact asymptotics for 
large deviations, we now take the time to formally introduce this result. 

Given X, Sn and A as above, let D\ be the set of values t for which A(t) 
is finite, and let L*^ be the interior of D\. We say X is a lattice random 
variable with period p > if there is a constant x such that (X — x) /p is 
almost surely integer- valued, and p is the largest real number for which this 
holds. We now state the asymptotic estimates for large deviations as they 
appear in Dembo and Zeitouni [17] (as Theorem 3.7.4): 

Theorem 1 (Bahadur and Rao [5]). Let S = {Sn}neN be a random walk 
with step size X , and define A and D°p^ as above. Choose any t G with 
t < and any function g{n) tending to plus infinity with n; then if X is 
nonlattice then for any a (z R with a < y/n/g{n), 

at-n{th' {t)-K{t)) 

(5) P{S„<A'W»-a} = (l + „(l)).-^=^j_, 

uniformly over a in the above range. Furthermore, if X is a lattice random 
variable with period p, then for any a which is a multiple of 1/p and for 
which a < yjnjgin), the same result holds. 

In fact, this theorem is stated with a constant in [17], but the proof in 
that book yields without modification the above formulation. Theorem 1 
contains two cases: the values of a for which (5) holds depend on whether 
X is lattice or nonlattice. Technically, this fact necessitates a case analysis 
in our proof, but as the two cases are virtually identical, we do not bother 
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with both of them. For the remainder of the paper, we assume that the step 
size X is nonlattice. 

We will shortly have use of the following corollary of Theorem 1. Let 
fit) = tA'it)-Ait). 

Corollary 2. For > t G D^, f(t) is infinitely dijjerentiable and strictly 
decreasing. Furthermore, if X is hounded from below then 

lim /(t) =log(l/P{X = essinfX}) 

t^—oo 

(where we interpret the right-hand side as oo if P{X = essini X} = 0). 

Proof. The facts that f{t) infinitely differentiable in and is strictly 
decreasing when t <0 follow from Dembo and Zeitouni [17], Lemma 2.2.5 
and Exercise 2.2.24. To prove the second assertion, first assume without loss 
of generality that essinf X = 0. Next note that A'(t) = B{X e^^ } /B{e*^ } > 
for all t^O. In particular, for all t < 0, by Theorem 1 we thus have 

p-nf{t) 

P{X =or < P{5„ < A'W«) = (1 +o(l))-^_=, 

so we must have f{t) < log(l/P{X = 0}). As t < was arbitrary it follows 
that lim4^_oo fit) < log(l/P{A: = 0}). 

To see that in fact equality holds, we first show that lim(_t_oo A'(t) = 
0. Note that for any fixed e > and t < 0, Ee*-^ > e*^/2p{X < e/3} and 
E{e*^/2l[x>,]} < e*"/2p{A: > e}, so 

^' Ee*^ - P{X<e/3} ' 

as f — > — oo. Next, given e as above, for all sufficiently large negative t and 
all X > e we have xe*^ < ee*^/^ , so 

E{Xe*^} = E{Xe*^l[x<e]} + E{Xe*^l[x>,]} 

<e(E{e*^l[x<e]} + E{e*^/2l[x>e]}). 
It follows that for such t, 

B{Xe'^} ^J^^E{e'^/H[x>e]} 



as t — > — oo, by (6). Since e > was arbitrary it follows that 

lim A'{t)= lim ^^^fP =0, 

as claimed. Now fix G N>o, and small 5 > 0, let e = 5/N and let t^ <tg <0 
be chosen so that A'{ts) = 6, A'{t^) = e [such choices of and t^ exist for 5 



6 



L. ADDARIO-BERRY AND B. REED 



small enough since A'(t) > for all t 7^ and lim(_>„oo = 0]. Then for 
any t < t^, f{t) < fits) - Furthermore, if we are to have Sn < en then at most 
n/N of the Xi can satisfy Xi > 6. Letting ps = P{X <5}, it follows that 

p-nf{te) ( / \ 

(1 + 0(1)) , = = V{Sn < en} < P Bm{n,ps) > n 1 - - 

SO 

hm fit) > fits) > -log(l/P{Bin(n,p5) > n(l - 1/A^)}). 

Since 6 > and N gN were arbitrary and lim^^oP^ = P{X = 0}, it then 
follows by standard binomial estimates that 

lim /(t) >log(l/P{X = 0}). □ 

t— 00 

Using the above exact asymptotics, it turns out that one can prove an 
analogue of Bramson's result for a wide range of branching random walks. 
The primary result of this paper is: 

Theorem 3. Consider a supercritical branching random walk with branch- 
ing distribution B and nonconstant step size X, and suppose that the fol- 
lowing conditions hold: 

(I) there exists an integer d>2 such that Y'{B < d} = 1; 

(II) there exists a real number u> such that Ee^^ < 00; and 

(III) there exists t £ D%, t < 0, such that tA'(t) - A(t) = log(ES). 

Let S be the event that the branching random walk survives. Then 

(7) E{M,|5} = A'(t)n-|logn + 0(l), 

and there exist constants C > 0, 5 > depending only on X , such that for 
all x G M, 

(8) P{|M„-E{M„|5}|>x|cS}<C7e-'^^ 

A brief discussion of the conditions appearing in Theorem 3 are in order. 
Conditions (II) and (III) control the positive and negative exponential mo- 
ments of X, respectively. Condition (III) is perhaps the least intuitive. For 
example, if X is bounded from below, then by Corollary 2, (III) simply re- 
quires that P{X = essinf X} < 1/Ei?. Together with a result of Bramson [8], 
this shows that condition (III) can not be removed. In particular, Bramson 
[8] has shown that if X is bounded from below and P{X = essinf X} = 
then EM„ = (essinfX) • n + O(loglogn) (Dekking and Host [16] contains 
further information about behavior of M„ in this case). The following the- 
orem highlights another situation in which EM„ does not have the form 
an + /?logn + 0(1): 
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Theorem 4. Given a supercritical branching random walk satisfying 
conditions (I) and (II), above, if X is bounded from below and additionally 
P{X = essinfX} > l/BB, then 

E{M„|cS} = (essinf X) • n + 0(1) 

and there exist constants C > 0, 6' > depending only on X such that for 
all X > 0, P{M„ > (essinf X) ■n + x\S]< C'e-^''^. 

Theorem 4 is not particularly difficult. It may be proved using the tech- 
niques developed by Hammersley, Kingman and Biggins in the course of 
proving the Hammersley-Kingman-Biggins theorem, and is also essentially 
contained within Dekking and Host [16]. We provide a proof that allows us 
to highlight, in a simplified setting, a technique we later use in proving The- 
orem 3. We remark that together with the aforementioned result of Bramson 
[8], Theorems 3 and 4 completely determine the possible behavior of Af„ in 
the case that X is bounded from below. 

If X is not bounded from below, then condition (III) essentially imposes 
that the left tail of X decays "sufficiently" (and in particular, at least ex- 
ponentially) quickly. In particular, if X is not bounded from below and all 
exponential moments {X — EX)_ are finite, then (III) will hold. 

Condition (II) is essentially necessary for our result to hold; in general, 
a relaxation of condition (II) may require a corresponding relaxation of the 
upper tail bound in (8). In particular, it may be the case that 

P{the root r has exactly one child|5} > 0, 

in which case it is not hard to see that the upper tail of M„ can decay no 
more quickly than the upper tail of X. 

Condition (I) is a requirement imposed by our particular use of the sec- 
ond moment method in the course of the proof. Independently of the cur- 
rent work, Hu and Shi [24] have shown that assuming bounded step size, 
assuming that (III) holds and that E{i?^+^} < oo for some e > 0, then 
{Mn — A'(n))/logn ^ (— 3/2t) in probability on S but that, surprisingly, 
{Mn — A'(n))/logn does not converge almost surely. It seems very likely, 
particularly in view of the former result, that condition (I) should not be 
necessary for the results of Theorem 3 to hold. However, we would expect 
different behavior in the case = oo. 

Remark 1. Condition (HI) allows us to apply Theorem 1 to obtain pre- 
cise estimates for tail probabilities. However, there are certain situations in 
which such estimates are available without recourse to Theorem 1, for exam- 
ple, by direct computation. The key properties we require are the following: 
there exist constants c G M and a, a' > such that 

P{S„ <cn-x] = G(e-"^' • ES~" • Ji'^/^), 
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uniformly over all X = o{^), and P{5„ <cn-x} = 0(6"'' ^ • EB^") for all 
X > 0. Whenever X satisfies these properties [and still assuming that (I) 
and (II) hold], we obtain an analogue of Theorem 3. More precisely, letting 
X* = x*{n) solve e~"^ • n~^^'^ = n, Theorem 3 holds (and with an identical 
proof) if we replace (7) by the conclusion that E{M„|5} = cn + x* + 0(1)- 
(The reason for this choice of x* will become clear over the course of the 
paper.) In particular, if X = — log(C/) where U is from the family of Beta 
distributions, such estimates are available by direct computation; this case 
is of particular interest due to its link with a variety of search trees arising in 
computer science (see, e.g., Chauvin and Drmota [13], Devroye [19], Drmota 
[21], Reed [28]). 

1.1. Our approach. We observe that it clearly suffices to prove Theo- 
rem 3 when EX = 0, and we shall hereafter assume this is the case. To 
begin to describe our approach, we observe that for [(Ei?)"J independent 
random walks with step size X, the expected minimum value EM™'^ of any 
node w at depth n in this "forest of random walks" is within 0(1) of the 
smallest m for which 

E{ I {nodes w at depth n such that 5"^^, < m}\} > 1. 

In other words, BM'^'^ is within 0(1) of inf{m:P{5^ < m} > 1/[(EB)"J}, 
a value we can easily derive to within 0(1) using Theorem 1. To get an idea 
why this is the case, choose t £ V^, t <0, and such that tA'{t) — A{t) = 
logEi?. For any real number a which is o{y/n), by Theorem 1 and a union 
bound we have 



P{Mi^'^ < A'{t)n - a} < (1 + o(l)) L(ES)"J 



e 



at~-n{tA'{t)~A(t)) 



x/A"it) ■ 27rn 



^gat-(logn)/2-0(l)^ 

SO P{M^'^ < A'(t)n — a} is exponentially small in ((logn)/2t — a) when 
a = o{y/n) and a > 0. Similarly, iiM}^'^ > A'(t)n-a then each of the L(ES)"J 
random walks S must have Sn > A'{t)n — a. By the independence of the 
random walks and by Theorem 1, we have 

P{AC<^ > A'(t)n - a} = (1 - P{5„ < A'(t) - a})L(ES)"J 

< exp{-(l + o(l))e"*-(^°s")/2-0{i)|^ 
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so P{M^'^ > A'{t)n — a} is doubly exponentially small in (a — (logn)/2t) 
when a = o{^/n) and a < 0. These inequalities do not quite yield a bound on 
EM™'^, but they do show that an extremely high proportion of the prob- 
ability mass of M^'^ lies near h'{t)n + (logn)/2t, and by combining these 
bounds with a Chernoff bound for the lower tail of M™*^, it is not hard to 
show that in fact 

E{Mn = A'(0n-i^ + O(l). 

We denote by m* = m* the quantity A'(t)n — logn/2t, and call m* the 
breakpoint. Whenever we write m* without subscript we always mean m^. 
By linearity of expectation, it follows easily from Theorem 1 that in the 
branching random walk, 

E{ I { w at depth n: ^nT,}\} ^ 1 when m — m* ^ 1 , 

E{|{tt; at depth n:Sw< m}|} ^ 1 when m* — m 1. 

Intuitively, then, the equation (7) can be understood by splitting the term 
(3 / 2t) log n into two pieces and writing 

(9) E{M„|5} = m*-i^ + 0(l); 

the term (log n) jt must then be explained by the dependence in the branch- 
ing random walk model. The bulk of the work of this paper is in under- 
standing and explaining why this dependence should contribute a term of 
just this form. 

At a high level, to explain this term, we shall apply the second moment 
method to bound the number of certain "special" nodes of T. We will in- 
troduce a notion of "goodness" of nodes; whether or not a node v is good 
will depend only on the shape of the random walk from r to v. We shall 
study the properties of the good subset G of the nodes of T at depth n, 
first showing that |G| is tightly concentrated around its mean, and then, 
with some additional work, showing that EM„ is in fact within 0(1) of 
infmE{|{i(; £ G : Sw < m}\} > 1. To begin to make this more concrete, we 
first explain the key property that "good" nodes will satisfy. 

1.2. Leading nodes. Given exchangeable random variables Xi, . . . ,Xn 
with associated partial sums Si, . . . we say that Sn is leading (or that 
S is leading after n steps) if 

(10) Si > B{Si\Sn} for alH = 1, . . . , n; 

equivalently, if Si > Sn ■ (i/n) for all i = 1, . . . ,n. (This terminology was 
introduced by McDiarmid [27].) Similarly, we say that Sn is strictly leading 
if the inequality in (10) is strict for alH = 1, . . . , n. Given a node v at depth 
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n in T, we say that f is a leading (resp. strictly leading) node if the random 
walk from the root to v is leading (resp. strictly leading). 

The intuition for why leading nodes are useful may be gleaned from exam- 
ples (A) and (C) of the preceding section. If a node v at depth n is leading, 
then knowing that <m does not increase the expected number of nodes 
w at depth n with <Tn hy too much. This fact allows us to use the sec- 
ond moment method (i.e., some variant of Chebyshev's inequality) to bound 
from below the probability that some such node exists when the expected 
number of such nodes is 0(1). (This line of argument via Chebyshev's in- 
equality is quite common in combinatorial settings; see, for example, Alon 
and Spencer [1], Chapter 4.) 

It turns out that EjM^} is within 0(1) of the smallest value m„, for which 
the expected number of leading vertices v at depth n with < rUn is at least 
1. Though it may not be immediately obvious, the assertion of the previous 
sentence is in fact equivalent to (7); we now explain this equivalence. 

To bound the probability that a vertex is leading, we use a classic combi- 
natorial technique introduced by Andersen [2] (and first used in the context 
of branching random walk by Devroye and Reed [20]), that we call a rota- 
tion argument, which involves considering cyclic permutations of the ran- 
dom variables Xi,...,X„; we will explain this approach in more detail in 
Section 3. 

By this method, we will straightforwardly be able to show that for all 
V E Nn and for values m < EM„ that are not too far from the breakpoint 

m , 

(11) Pj'S'^ < m and is a leading node} = 
(We shall make this more formal in Section 3.) It follows that for such m, 

B{\{veN^,S^<m,v is leading}!} = ef ^ili- ^ ^n, < m}\} 

\ n 

(12) 

= e(^^.P{5.<m}), 

so we are in fact asserting that EM„ is within 0(1) of the smallest value 
m for which, for nodes v at depth n in T, P{S't, < m} > (Ei?)"/n. It fol- 
lows immediately by Theorem 1 that m is within 0(1) of m* + (log n)/t, in 
accordance with (7) and (9). We may then view the term (logn)/t as the 
correction required in order to find a leading node. One of the key steps in 
proving Theorem 3 will be to understand the shape of random walks that 
end in leading nodes, and in particular how much time such walks spend 
near their (conditional) means. 
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1.3. A little notation and a basic fact. Let r„ be the subtree of T con- 
sisting of all nodes of depth at most n. We may view Tn as a subtree of a 
rooted, labeled d-ary tree with n levels and with root r, in the following 
manner. For a given node v of with children wi, . . . , Wd, let By be a copy 
of the offspring random variable B, and let X^^i, . . . be independent 

copies of X. Let a^, be a uniformly random permutation of {1, . . . ,d}; we 
assign label X^^i to edge vw^^^^i-^ for 1 <i <By and assign label oo to edge 
'^^CT„{i) for By < i < d. (This permutation gives the model a useful symme- 
try property that will be explained shortly.) We repeat this procedure for 
all nodes of T^, and label each node v with the sum Sy of all edge labels 
on the path from r to v (setting Sy = oo if any of these labels are infinite). 
With this labeling, the subtree of induced by nodes v with < oo is 
distributed precisely as Tn, and we hereafter view Tn = Tn{Tn) as a subtree 
of such a d-ary tree T^. 

For 1 < i < n and for any m G M, let iVj (resp. N'-) be the set of nodes of 
Tn (resp. T^) at depth i, and let Ni^m be those nodes v in Ni with Sy < m. 
To each v £T^ we assign a label Sy that is the sum of the edge labels X^ on 
the path from r to v — so it is possible that Sy = oo for some nodes v. We 
observe that for any integer m > 0, M„ is the smallest m < +oo for which 

For the sake of our analysis, it will be useful to fix a distinguished path 
P in with nodes r = t;o, f i, . . . , (it may be useful to think of this path 
as running "along the left-hand side" of T^). Each node Vi has one child 

fj+i = vf^^ in P — let its other children be called • • • ^vfj^-^^ . Denote by 

r*'J the subtree of T'^ rooted at vf . Let Tj^^ = T*'^' n Tn (which may be 
empty) , and let A^^'{„ be the set of nodes of Nn^m that are contained in T^^ . 

Given an automorphism aiT'n) of T^, we may view a as acting on the 
labels Xy^i and on the permutations Oy, by viewing the permutations Oy as 
fixed to the nodes of T^, the labels Xy^i as fixed to the edges of T^, and both 
as being moved by the automorphism a. The presence of the permutations 
Oy ensures that for any automorphism ol{T'^ of T^, the distribution of 
(with both permutations and labels attached) is identical to that of a(T^) 
(with both permutations and labels attached), and in particular, a induces a 
labeled isomorphism between Tn{T^) and r„(a(T^)). This symmetry, gained 
by the addition of the permutations, will greatly simplify later calculations. 
In particular, it yields the following fact. 

Fact 5. Let E,F and G be events in the ir-field generated by with 
its labeling such that E is invariant under automorphisms of and there is 
some automorphism a of T'^ such that G = a{F). Then P{£^|F} = P{£^|G}. 

For example, if F is the event that Sy < m for some m G M and some 
given V £ Nn, then G could be the event that Sy' < m for any given v' S Nn- 
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We will often use the phrase "by symmetry" in our arguments, rather than 
making explicit reference to Fact 5. 

1.4. Outline. In Section 2, we prove Theorem 4, and in doing so intro- 
duce the idea of amplification, which also plays a role in the proof of Theo- 
rem 3. In Sections 3 and 4.1, we flesh out the high-level discussion of Reed's 
approach given above and explain some of the key ideas behind our proof 
of Theorem 3, particularly the upper bound on Af„ and the importance of 
leading nodes. In Section 4.2 we prove two key lemmas which allow us to 
control the shape of random walks conditioned on their value after time n. 
Finally, in Sections 4.3 and 5, respectively, we prove lower tail bounds and 
upper tail bounds on M„ that together prove Theorem 3. 

2. Proof of Theorem 4. To give an idea of how we will prove Theorem 4, 
we first consider the special case that the offspring distribution B is de- 
terministically d and that X is deterministically bounded from above, say 
X < A for some constant A. In this case Pj/S} = 1 so the conditioning in 
Theorem 4 vanishes. We also presume for simplicity that essinf X = 0. We 
consider the related branching process Tq in which the set of children of 
a node is the set of its children in T for which the displacement is 0. As 
P{X = 0} > l/EB = 1/d, To survives with positive probability, so there is 
a positive probability pq that M„ = for every n. 

Suppose that we want to bound the probability that M„ is greater than 
X, for some given x. Since X < A almost surely, if n is at most x/A then 
every node at depth n has label at most nA < x, so P{Mn > a;} = 0. For 
larger n, we first observe that for any node v at depth [x/AJ , the tree rooted 
at V whose nodes are the descendants w oi v with S^, = Sy is distributed 
precisely as Tq; we temporarily denote this tree Tq{v). 

The tree To{v) survives with probability po, and if Tq{v) survives then 
in particular, there is a descendent w v at depth n in T{£) for which 
Sw = Sv, so Mn < Sw = Sv < x. It follows that 



Since the subtrees rooted at distinct nodes at depth [x/A\ are independent 




V at depth [a^/AJ 



n 





P{Tq{v) does not survive} 



V at depth [x/AJ 



= (l-Po) 



which in particular proves the tail bound of Theorem 4 and also immediately 
implies that EM„ = 0(1). The key to the above line of reasoning is the idea 
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of analyzing the subtrees of T rooted at depth [x/AJ independently in order 
to strengthen our probability bound. We will hereafter refer to this technique 
as an amplification argument. McDiarmid [27] uses this idea in much the 
same fashion as above in his analysis of the minima of branching random 
walks; it also plays a key role in both Devroye and Reed [20] and Reed [28]. 

When Y'{B = d} < 1 and X is not necessarily bounded, we can not argue 
as straightforwardly as above. However, we still have that P{Tq survives} = 
Po for some po > 0. For x > integers d> 0, we temporarily let Fx^d be 
the set of nodes at depth d with label at most x. Using an amplification 
argument just as we did in deriving (13) immediately yields that for any 
integers c and n with c > and n> d, 

(14) P{M„>x||F,|=c}<(l-po)'. 

So, to handle this case, we really need to control the distribution of the 
number of nodes at a given level of a supercritical branching process whose 
labels are not too large. To do so, we use the following result of McDiarmid 
([27], Lemma 1), who showed that for any supercritical branching process 
there exist constants 70 > 1, cq > 0, and 5o > such that for all integers 
i > 1, 

(15) P{0<|iV,|<7^}<coe"^"\ 

Fix 7o, Co and 60 as above, and define the function £{x) = [log^^ x] . Since S 
occurs precisely if \Ni\ > for all i, for n > i{x), 

P{M„ > x,cS} < P{0 < [iVL^WJ \<x} + P{Mn > x||iV^(^)| > x} 

(16) 

<coe"^o^+ sup P{M„ >x||7V^(^)| =A;}. 

x<k<d'(^) 

For fixed k in the above range, we have 
P{M„>x||A^f(^)|=A;} 

k 

= ^(P{M„>x||iV^(^)| = /c,|F^^^(^)| =i} 
(17) •P{|F,,,(,)|=i||iV,(,)|=fc}) 

k 

< 5^(1 -pom\Fx,e(.)\ = i||iV^(.)| = k} 

i=0 

< (1 - po)''/' + P{|i^.,f(x)l< ^/2||iV,(,)| = k}. 

Now fix u > such that Eie'^^ = a < 00; such u and a exist by condition (II). 
We then have 
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— pUX 

(18) 

for some ci > and 6i> and for all x > 0. It follows that 
E{|iV,(.) \F,,,(,)|||iV,(,)| = k}< cie-'^i" • k, 
so by Markov's inequality 

(19) P{|F,,,(,)| <fc/2||iV,(^)| = A;} <2cie-'5i^ 
Combining (16), (17) and (19), we thus have 

(20) P{M„>rE,cS}<C2e-''2^ 

for some C2 > and 62 > and all x > 0. Finally, since T is supercritical, we 
have P{5} =p>0. Letting C3 = C2/P, by (20) we thus have P{M„ > x\S} < 
cse~^^^ for n > i{x). For n < £{x), since given S there is at least one node 
in Nn, by symmetry and arguing as in (18) we have 

P{Mn > x\S} < P{x < < 00} 

This proves the exponential tail bounds of Theorem 4 and also shows that 
E{M„|5} = 0(1) = essinf X + 0(1). We remark that since we are assuming 
essinf X = 0, there is a shorter proof in the case that n < i{x) since Mg^^^ < x 
implies M„ < x. We have given the above argument since it does not use the 
fact that X is bounded from below, and we will appeal to it when proving 
Theorem 3. 

3. Typical leading nodes. We recall that a node f G r„ is strictly leading 
if the random walk ending at v stays strictly above its conditional expected 
value given S^,, that is, if it satisfies (10) with strict inequality. For m G M, 
we let 

(21) Gn,m = {v £ Nn : m — 1 < Sy < m, is strictly leading}. 

We impose the requirement that m — 1 < Sj] as it gives us more precise con- 
trol over the random walk; however, when m is near m* , Theorem 1 implies 
that Pj/S^, > 771 — l\Si, < m} is bounded away from zero. Furthermore, by 
replacing the constant m—1 by m — C for some large constant C, we could 
make this probability arbitrarily close to 1. Intuitively, therefore, we can 
think of Gn,m as the set of "typical" leading nodes in with Sy < m. The 
following lemma is the promised formalization of (11). 
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Lemma 6. Given a random walk S with steps distributed as X , and real 
numbers a, c with c > 'EX > a and for which P{EX < X < c} > 0, 

Pl-^n < o,n,Sn is strictly leading} < — ^ " ~ ^ 

n 

and 

P{'S'n-i < an — c} 



P{'S'n < an, Sn is strictly leading} > P{EX < X <c} 



n — 1 



Proof. The proof of Lemma 6 is an adaptation of proofs from Andersen 
[2] and Reed [28]. For fixed n, and n' with n<n' <2n, let 

n' 

Sn' — Sn + Sn'—n — ^ ^ ^iCmod n) ■ 
i=l 

We first note that if Sn is strictly leading, then for any j = 0, . . . ,n — 1, the 
random walk S^^^ with S'p^ = Sj+i — Sj for i = 1, . . . , n is not leading, since 
5^^'^ = Sn and 

Sn-j = Sn — Sj < Sn ^ ^ 

More strongly, an identical argument shows that at most one of the random 
walks S = S^^^ , S^^^ , ■ ■ ■ , 5"^"^ is strictly leading. Furthermore, as the random 
variables Xi,...,Xn are independent and the event {5„ < an} is fixed by 
permutations of Xi, . . . , Xn, it follows that for all j = 1, . . . , n, 

^{Sn < an, Sn is strictly leading} = P{5„ < an, S^^^ is strictly leading}, 

and so 

n • P{Sn < an, Sn is strictly leading} 

= P< [J < an, S^'^ is strictly leading} > 
lj=o J 

< P{5„<on}, 

proving the upper bound of the lemma. 

Next, for i = 1, . . . , n — 1, let Si = Si+i — Xi. In order that Sn < an and 
that Sn is strictly leading, it suffices that: 

• EX <Xi<c, 

• Sn-i <an — c and 

• Sn-i is leading. 
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As Xi is independent from Si, ... , Sn-i^ it follows that 
^{Sn < (in,Sn is strictly leading} 

(22) 

> P{EX <Xi<c}- P{S'„_i <an-c, Sn~i is leading}. 

For j = 0, . . . ,n — 2, define the random walk 5^-'^ by = Sj+i — Sj, for 
i = I, . . . ,n — 1 (where, letting n' = j + i, if n' > n — 1 then = Sn-i + 
Sn'-(n~i))- Again we have that S^^l^ = Sn-i for all j = 0, . . . , n — 2. Fur- 
thermore, if Sn~i is not leading then, letting j* be an index for which 
Sj* — Sn-i{j* / {n — 1)) is minimized, it follows immediately that for all 
i = 1, . . . , n — 1, 

qO*) C Q 

'->i — •JJ+J* ^1* 



(23) 



— Si+j* Sni ^ ) -\- Sni ^ ) Sj* 

\n—\J \n— 1 J 



n — 1 



that is, S^^"^ is leading at time n — 1. Therefore, at least one of the random 
walks = S, S^^\ is leading at time n - 1. Since X2, . . . , X„ 

are independent and the event {Sn-i < an — c} is fixed by permutations of 
X2 , . . . , Xn , we thus have 

P{Sn-i <an-c} = p| Sn^i <an-c,\J {s';^!^ is leading}| 

(24) . . 

< (n — l)P{S'„_i <an — c, Sn-i is leading}. 

Combining (22) and (24), it follows that 

PIS' _i < an — c\ 

P{5„ < an, Sn is strictly leading} > P{EX < Xi < c} • ^ " -, 

n — 1 

proving the lower bound of the lemma. □ 

By combining the above argument with the bounds from Theorem 1, the 
following lemma is immediate: 

Lemma 7. Given any function g{n) tending to infinity with n, for any 
V G Nn and any m for which \in* — m| < ^/n/ g[n), 

'P{5^, <m}~ 



n 
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uniformly over all m in the above range. 

We omit the proof of Lemma 7 as it is essentially identical to that of 
Lemma 6 (taking a = nn/n and choosing any fixed c > EX for which P{EX < 
X < c} > 0; such c exists since X is nonconstant). 

4. The upper bound. 

4.1. A warmup. To prove an upper bound on M„, we will eventually 
show that if E|G„,m| = ^(1) then P{|G„,m| > 0} = $7(1) [the definition of 
Gn,m appears in (21)]. In order to demonstrate one of the key techniques of 
the lower bound in a simplified setting, we prove: 

Lemma 8. Form > m* for which m* — m = o{y/n), if'E\Gn,m\ = ri(l/n^/^) 
then P{|G„,„| > 1} = 17(1/^5/2) = 17(l/m5/2). 

To prove Lemma 8, we use a version of the second moment method of- 
ten called the Chung-Erdos inequality (see Chung and Erdos [15] and also 
Devroye and Reed [20] and Alon and Spencer [1], Chapter 2), which in our 
setting can be stated as follows: for any integer i>l and any random set 
TZ C N'^ (recall that A'^^ is the set of nodes at depth n in T^), 

(25) P{|7^|>0}> 



l + sup,g^, E{|7^||^;E7^}■ 



We will apply (25) both here and later in the section. Recall that each Vi on 

the distinguished path vq, . . . , Vn-i has children f j+i = vf^i,vl^^, . . . , vl'^i^^ 
in T^. For i = 0, . . . ,n, j = 0, . . . ,d — 1, and m G M we hereafter denote 



(26) -^nm = £ Nn,m '■ V IS a desccndeut of vp^}, 
and define G^-'^ similarly. 

Proof of Lemma 8. By the symmetry of T^, E{|G„,m||^^ S Gn,m} is 
identical for all v G N^, so letting TZ = Gn,m in (25), we obtain 

(27) P{|G„,„|>1}> ^{\Gn,m\} 



1 + E{|G„^rn|| 

By symmetry, 

E{|G„,„|} = d"PKGG„,„} 

(28) 

= rP{Vn G Gn,m\Vn S Nn}P{Vn S N^}. 
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Given that f„ G iV„, Si,^ is just a sum of n i.i.d. random variables distributed 
as X, so by Lemma 7 and Theorem 1, 



P{Vn £ Gn,m\Vn G iV„} = 9 

(29) 

= e 



P{5'^,„ < m\vn £ iVn} 
n 

^—t{m—m,* ) 



where t < has been chosen such that tA'{t) — A{t) = log(ES). Now, for any 
edge e = vw of the tree , 

d 

P{Xe < 00} = ^ P{Xe <oo\B^ = i}P{B^ = i} 

(30) 

As the variables {-B„}i,gT;; are independent, it follows that for distinct edges 
e = vw and f = xy, and are independent unless v = x, that is, unless 
and y are siblings. In particular, it follows from this independence and 
from (30) that 

n{^.,_,.,<oo}|=(^— j . 

By combining (28), (29) and (31), we obtain the bound 

' —t{m—m* ) 

(32) E|G„,^| = e' 



n 

[A brief remark: we will usually omit the sorts of arguments leading to (31) 
when such calculations arise in later proofs; we have included them once for 
completeness.] We now turn to bounding E{|G„^m||'^n S Gn,m}- By symme- 
try, we have 

} < E{\Nn,m\\Vn G Gn,m} 
n-ld-1 

(33) = 1 + E E E{K+^'^||^;„ G Gn,m} 

i=0 j=l 

= 1 + (d - 1) ^ B{\Ni+^^%n G Gn,m}- 
i=0 

Since Vn G Gn,m, for i with < « < n we have 5"^,. > (m — l){i/n) > mi/n — 1. 
It follows that for such i, N^^^ is at most the number of descendants 
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V of v^^i for which Sy — Sy^ is at most m{n — i)/n + 1. We recall that 
m* = A'{t)i — logi/2t. Let Aj = mi/n + 1 — m* for i = 1, . . . ,n, and fix some 

distinguished descendant v of at depth n. By the previous observation, 
linearity of expectation and symmetry we have 

E{\Ni+y\\VnGGn,m} 

< |iV;+^'^|P{5^ - S,,^ < ml_i + \n-i)\Vn e Gn,m} 

(34) 



We now claim that for all i = 1, . . . ,n, 

(35) e"*^"- =0(ni/2e-*(™-'"*)). 
From (33)-(35), it follows that 

(36) n\Gn,n.\\Vn G Gn,m} < 1 + ©(n^/^e'^^"*) ) . 

Combining (32) and (36) we see that if E|Gn,m| = ^{l/n^^'^) then 

B{\Gn,m\\Vn G Gn,m} < 1 + 0{n'/^ ' E|G„,„|), 

which together with (28) proves the fact. To see that (35) holds, we write 

m{n - i) ^ 

^n^i = m„_j + 1 

n 

= ^^^(^-A'(t)(n-z)-i^^(^ + l 

mi logn logn — log(n — i) 

= m - + A'(i)z - ™ + i^^^^^M^ 
^ ^ n 2t 

1 

<m — m* H — {A'{t)n — m) + 1 
n 

^ * logn 

< m — m h 1, 

2t 

the final inequality holding as m > m* = A'{t)n + logn/ (2t) (recall that t is 
negative). Thus A„_j < m — m* — logn/2f, from which (35) follows imme- 
diately. □ 
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By Lemma 8 and a standard amplification argument, we could show 
tfiat P{\Nn^m+aiogn\ > 1} = ^(1) for some a = 0(1) and tfiereby show that 
E{M„|5} < m* + alogn; we will not do so as we are headed toward stronger 
upper bounds, the key step of which is to strengthen Lemma 8. We now state 
this aim formally. 

Lemma 9. For all values of m for which m* — m = o{^/n), 

(37) ^{\Gn,m\\Vn G = E|G„,„| + 0(1). 

Assume for the moment that Lemma 9 holds, and let 

/ * fog^^ A'm 31ogn 
m =m = K[t)n . 

By Lemma 7 and linearity of expectation, we see that m' is within 0(1) 
of the smallest value m for which E|G„,^m| > 1- Applying (25) with TZ = 
Gn,m': it follows immediately that P{|G„^m'| > 0} = 17(1). Since Gn,m ^ 
Nn^m for all m, it follows that P{\Nn,m' \ > 0} = 17(1), so P{M„ < m'} = 
11(1). By applying an amplification argument exactly as we did when proving 
Theorem 4, we immediately obtain exponential tail bounds for the upper tail 

of Mn. 

Corollary 10. There exist constants Ci > 0,5i > such that for all 
x>Q, 

P{Mn >m' + x\S} < Cie-'^i'^. 

To obtain the upper tail bounds of Theorem 3, it thus remains to prove 
Lemma 9. 

4.2. The shape of random walks. The proof of Lemma 9 is based on es- 
tablishing finer control over the behavior of a given random walk in T than 
that given by Lemma 7. In this section, we prove the two lemmas that accom- 
plish this. In Lemma 11, we bound the probability that a walk ^o, Si, . . . ,Sn 
is ever very far below its conditional mean given S'„; in Lemma 12, we bound 
the probability that such a random walk is ever close to its conditional mean, 
given that it is never very far below its conditional mean. These results are 
rather straightforward; the only technicalities are due to the fact that we 
must apply Theorem 1 in the course of the proofs. We now proceed to the 
details. 

For a < 0, we say that S stays above a up to time n, and write {Sn abo a}, 

if 

1 

Si> Sn - — ha for alH = 1, . . . , n. 

n 

In particular, {Sn abo 0} is simply the event that Sn is strictly leading. 
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Lemma 11. Given any function g{n) tending to plus infinity with n, any 
m for which |A'(t)n — m| = yjnj g{n), and any a < —1, 



lap 



n 



(38) P{5„ abo a\m - 1 < Sn <m} = O 
uniformly over all m and a in the above ranges. 

Proof. For simplicity we assume that a is an integer, that m = A'(t)n, 
and that g{n) < logn; this eases the notational burden without changing the 
essence of the proof. The probabihty in (38) increases as a decreases; it thus 
suffices to prove (38) when \a\ is at least some large fixed constant C. We 
assume \a\ is large enough that \a\ < ^|aP/log(|ap) < \/|a[/(7(|a|). 

We remark that if |a| > n^^^ then (38) holds trivially; we thus assume that 
\a\ < n}-!'^ . Let ni = n + 2|ap, and let m\ = A'(t)ni; as \a\ < n^^^, we have 
ni = 6(n). Let S' be the random walk with 5,- = Si^p+j — 5|a|3 (the original 
walk "started at time |ap"). The walk S' is distributed as 5; we will show 
that 

(39) P{S'^ abo a\m-l<S'^<m} = Or-^ 

which proves the lemma. We proceed by comparing the following two prob- 
abilities: 

P{?ni — 3 < Sni < rni, Sm abo 0} and P{?7t, — 1 < < m, 5^ abo a}. 
By Lemma 7, 

Pjmi — 3 < bn^ < mi,bni abo 0} = 9 1 I 

(40) 

- q( ^^^^ - 3 < S'ni < mi) 
V n 
By Theorem 1, 

/ -A'(t)ni s / -2|a|3A' (t) -A' (i)n\ 

P{m, - 3 < 5.. < .n,} = e (-^) = G ( ) 

(41) 

= e(e-2|«l'A'{t)p{m - 1 < 5; < m}), 

so, letting E* be the event {mi — 3 < < mi,5„^ abo 0}, (40) and (41) 
yield that 



(42) p{E*} = e 



g-2|a|3A'(t)p|^ - 1 < 5; < m} 



n 
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We will show that 
(43) r[E*] o|^ «"""''^'^*^PW-|yn<"^,^; cibo a} y 

Equation (39) follows immediately from (42) and (43); it thus remains to 
prove (43). We claim that for E* to occur it suffices that the following events 
occur: 

(1) A'(t)|ap + |a| + 1 < 5'|„|3 < A'{t)\a\^ + \a\ + 2, which we denote Ei; 

(2) A'(t)|a|3 - (|a| + 3) < - Sn^_ia\a < A'it)\af - (|a| + 2), which we 
denote E2; 

(3) m — 1 < S'^ <m, which we denote £"3; 

(4) S\a\3 abo 0; 

(5) letting 5]^^ = Sn^_\a\3+j - Sn^_\a\'i for j = 0, 1, . . . , \af, S^^^^ abo 0; and 
finally, 

(6) S'^ abo a. 

Events (l)-(6) are depicted in Figure 1. Informally, (1) and (4) ensure that 
the first |ap steps of the walk do not prevent E* from occurring, (2) and (5) 
do the same for the last |ap steps of the walk, and (3) and (6) do likewise 
for the middle ni — 2|ap = n steps of the walk. [This is not quite the whole 
story; the "extra height" (between \a\ + 1 and \a\ +2) gained in (1), as well 
as the extra height gained in (2), are required so that (3) and (6) can do 
their job.] By the independence of disjoint sections of the random walk, we 
thus have 

P{E*}>P{Ei,E2,E-i,S\a\3 abo 0,si^l abo 0,S'„ abo a} 




Fig. 1. The events (l)-(6). For example, (1) occurs because after |ap steps the walk 
is between the lowest two dashed horizontal lines, and (4) occurs because in the first \a\^ 
steps the walk stays above the dashed line segment connecting its endpoints. The event E* 
occurs because after n\ steps the walk is between mi — 3 and mi and has stayed above 
dashed line connecting its endpoints. 
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(44) 

= P{^i,5|„|3 abo 0} •P{S2,-Sf„^|3 abo 0}-P{Es,S'^ abo a}. 

As \a\ < y/\a\'-^ /g{\a\), we may apply Theorem 1 and Lemma 7 to bomid 
P{Ei,S\a\-s abo 0}: 

P{i?i,5|,|3 abo 0} = Qf^^ 



(45) = G 



Likewise, we have 



G 



(1) ...m_o^P™ 



a 

apA'(f) -t|a| 



|aP\/|QT 



a 



1 9/2 



\a\^A'{t) t\a\ 



(46) P{E„ Si:>, abo 0} = G j = Q 

Combining (44)-(46) yields that 

/ -2|a|3A'(t) ^ 

P{i?*} = 0(^ P{E-s,S'^, abo a}j, 

which is precisely the claim in (43). □ 

Using Lemma 11, we can bound the conditional probability that S spends 
much time near its mean, given that it is never much below its conditional 
mean up to time n. More precisely: for < /c < n, let b{n, k) = minjA;, n — k}, 
and let be the event that < Sn(k/n) + b{n,k)^/^''^ (that is "close" 
to its conditional mean; the meaning of "close" depends on how near k 
is to one of the ends of the random walk). Given an integer a < — 1, let 

I 1 57 

Ba = Ufc=|a|57 Ck (if > n/2 then Ba is the empty event). Then: 

Lemma 12. Given any function g{n) tending to plus infinity with n, for 
any m for which \K'{t)n — m\< ^Jnlg(n) and any integer a < —1, 



1 



(47) P{Sn abo a, Ba\m - 1 < 5'„, < m} = O 



Proof. As in the proof of Lemma 11, for the sake of readability we 
assume that m = A'{t)n and that g{k) < log A; for all k. We may presume 
that |a|^^ < n/2 or else the claim holds trivially. For any fixed K, for all 
a with \a\ < K, the claim follows immediately from Lemma 11, so we may 
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assume \a\ is larger than any fixed K; we henceforth assume \a\ is large 
enough that \a\ < ^/\a\^ /g{\a\). 

Choose k with b{n,k) > |a|^''. If {m — 1 < 5„ < m} and Ck are to both 
occur then necessarily 

(48) Sk< — + b{n, A:)i/57 = A'(t)A: + b{n, kf''>\ 

n 

Similarly, if {m — 1 < < m} and {S^ aho a} are to both occur then 
necessarily 

(49) 5fe > A'(i)A: + a - 1 

(recall that a is negative). More strongly, for C^, {m — 1 < S'n < m} and 
{Sn oho a} to all occur, there must be some integer % with a — 1 < i < 
6(n, fc)^/^^ for which the following four events occur: 

• A'(i)/c + i < S'fc < A'(t)/c + i + 1, which we denote Ai^k\ 

• {Sfc a6o a — (i + 1)}, which we denote -Dj.fc; 

and letting 5^ = S'fc+j — 5j for < j < n — /c, 

• A'{t){n -k) - {i + l)< S'^_^ < A'{t){n -k) -i, which we denote Ei^k', 

• {'S'^-fc fi^o a — (i + 1)}, which we denote Fi^k- 

Restating the above using the names of these events, we have 

[fe{n,fc)i/57j 

Ckn{Sn abo a}n{m-l<Sn<m} C [j Ai^knDi^knE.^knFi^k- 

i=a—l 

(50) 

„_U|57 

Since Ba = Ufc-|a|57 ^fc' follows by a union bound that 
P{Ba, Sn abo a, m — 1 < 5.„ < n} 

(51) 

n-|a|57 [b(n,fc)l/57j 
fc=|a|57 i=a-l 

We will show that for each k with 6(n, k) > |a|^^, for all i in the above range, 
(52) P{^,,,, = 0[P{m-l <Sn<m}- 

We remark that the bound in (52) does not depend on i. Presuming for a 
moment that (52) holds, by dividing through by P{m — 1 < 5'n < m} in (51) 
and (52), we obtain 

P{Ba,Sn abo a\m — 1 < Sn < n} 



MINIMA IN BRANCHING RANDOM WALKS 25 

n-\a\'-^[b(n,ky/'-^] J . 

~. ■ 1 P{m-l<5„<m} 

k=\a\''^ i=a~l — II — J 

Since b{n,k)^^^'^ > \a\, the inner sum in (53) has at most 2b{n,k)^^^'^ + 2 
identical terms. We thus have 



\ fc=|a|57 / 



which proves the lemma. It therefore remains to prove (52). To do so, we 
first write 

P{^i,fc, A,fc} = ^{A^,k} ■ P{Di,k\A,k}. 

Since \i\ < [b{n,k)\^/^'^ < k^/^'^ < \/k/g{k), and likewise \a\ < b{n,ky/^'^ , we 
have \a — {i + 1)| < 2^/k/g{k) + 1. By Lemma 11 (applied with a slightly 
different function g) it then follows that 

(54) p{A.iA.,}^o( 'i°i+i;i+"° )=o(M!^). 

so 

(55) P{^.„o.,,, = o(£(ii^M!i"- 

and an identical derivation yields 

(56) P{E,^k,Fi,k} = 0i ^ )• 
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By the independence of disjoint sections of the random walk, Ai k and Bi^k 
are independent of Ei^k and Fi^k, so P{Aj,fc, A,fc, = P{^i,fc, A,fc} • 
P{-Ej_fc, Fj^fc}, and by (55) and (56) we thus have 

(57) P{ A,,fc, A,fc, i^^.fc} = O ^ ^'^^ fc(^_fc) j ■ 

Finally, the events Aj^fc, -Ej^fc and {m — 1 < 5„ < m} all simply restrict the 
value of a certain random walk at its endpoint; by applying Theorem 1 to 
bound P{Ai±}, F{Ei^k}, and P{m — 1 < < "m}, it is easily seen that 

P{A..} • P{E.,k} = e (^P{m - 1 < 5„ < m} . 

Combining this last equation with (57) proves (52) and completes the proof. 
□ 

It is interesting to compare Lemmas 11 and 12 with the work of Bramson 
[8] in studying branching Brownian motion; he required quite similar bounds 
on the behavior of a Brownian bridge (or equivalently, of Brownian motion 
conditioned on its value after some time t). It would be interesting to find 
a proof of Theorem 3, or at least of Lemmas 11 and 12, that proceeded 
via comparison with Brownian motion. However, such an approach may be 
quite difficult, given the lack of success to date at transferring Bramson's 
argument to a discrete setting. At any rate, with these lemmas under our 
belt we are ready to prove Lemma 9. 

4.3. Proof of Lemma 9. The chain of reasoning is similar to that in the 
proof of Lemma 8. We first recall that for a node v at depth n, by Lemma 7 

(58) p{t> G G„,m4 = e ( I. 

Recalling the event Ba defined just before Lemma 12, we will write Ba{v) for 
the event that Ba occurs for the random walk ending at v. Since \A'{t)n — 
m'\ = O(logn), by Lemmas 11 and 12, for all integers a < — 1, 

P|m — 1 < bi, < m, by abo a, Ba[v)} = B --r: 

V n\a\^ ) 



o 



lap 



It follows that there is some large constant C > such that if u G Gn,m' then, 
letting Rq, Ri, . . . , Rn be the partial sums on the path to v, with probability 
at least 1/2 

(59) Rk>Rn-^ + h{n,kf'^'^ for all C < A: < n - C. 
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If u G Gn,m' and v additionally satisfies (59), we say that v is well-behaved. 
We write Wn^m' for the set of well-behaved nodes in Gn,m' ■ We emphasize 
that Wn,m' ^ Gn^m' ^ ^n,m' ■ Furthermore, since each node in Gn^m' is well- 
behaved with probability at least 1/2, it follows from the definition of m' 
that E|VFn,m'| = Q.{^\Gn,m'\) = 5^(1)- We claim that 

(60) ^{\Wn,m'\K e Wn,m'] = E|H^„,™'| + 0(1). 

Applying (25) with TZ = Wn,m', it immediately follows that P{|W„,m,'| ^ 
1} = r2(l), so since Wn^m' ^ ^n.m' we have PjlA'^^^m'l ^ 1} = ^(1) as claimed. 
We thus turn to proving (60). 

We now remind the reader of some notation from earlier in the section, 
and introduce a few new terms. We recall that node f j_i has child vf''^ = Vi 
that is on the distinguished path P, and that its remaining children are 
vf^ , . . . , v^^ . Node v\^'^ is the root of a subtree of r„ that we shall call 
T^'-'. We let W^^^ be subset of W^'-^ contained in T^^; this is consistent 
with the notation G]i^^ and N^i^^ introduced in and just after (26). For each 

0<i<n— Ifixan arbitrary node Xi at depth n that is a descendent of f^^^, 
and let the partial sums of the labels on the path from Vi to Xi be Sx^fi = 
0, Sxi,i-,Sxi,2-, ■ ■ ■ , Sxi,n-i- We remark that the edge labels contributing to the 
sum Sx^^n-i are a subset of the edge labels contributing to the vertex label 
; more precisely, 

(61) Sxi = Sx^^n—i + S^^. 

Finally, to simplify notation, for < i < n — 1 let Wi = W^]^,, and denote 
the functions P{-\vn G Wn,m'} and E{>„ G Wn,m'} by P'"{-} and E^^j-}, 
respectively. We now mimic the portion of the proof of Lemma 8 that leads 
to (33), in our case for the particular value m = m' . By symmetry, we have 

n d—1 

e-{|P7„,^,|} = i + EEe'"{<1J 

i=l 1=1 

(62) 

= 1 + J2id-1)B^{W,}. 

i=l 

Since d does not depend on n and, for a given i, Wh'-i < i'^, it follows that 
for any fixed integer < c = 0(1), YJl=n-c{d - l)'^'^ {Wi} < Y.Ui{d-iy = 
0(1). By this fact, by (62), and since {d— 1) is constant, to prove the lemma 
it therefore suffices to show that 

h' —C 

(63) E E-{r.|} = 0(l), 

1=1 

where C is the same constant as in (59). 
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By symmetry, E"'{|Wi|} = cP'^P'^lxi G Wi}. In order for Xj G Wi to oc- 
cur, we must in particular have that Sx^ < m' , so by (61), we must have 
Sxi,n-i < m' — S^^. The fact that Vn is weh-behaved aUows us to bound 
m' — Sy^ from above. Furthermore, Sxi,n-i is distributed as 5't,^_- and is in- 
dependent of S^. . This independence wih ahow us use Theorem 1 to bound 
the conditional probability that Sxi^n-i <m' — 5'„. . When i is far from 1 
and from n — C, the bounds on Si given by the fact that Vn is well-behaved 
will ensure that S^. is large enough that the conditional probability that 
Sxi,n-i <m' — Sy^ is extremely small. By slightly modifying this same ap- 
proach, we will prove similar bounds when i is near 1 or near n — C; summing 
these bounds will prove (63). We now turn to the details. 

Fix some function g with g{k) < log/c for all k and with g{k) tending to 
plus infinity as k tends to infinity. For any i, for all c with |c| < yjn — i/g{i)^ 
by Theorem 1 and by the independence of from Sxi,n-i-, we 

have 

P'"{Sx,,n~i<h'{t){n-i)+c] 

(64) = V{Sx,,n-i < A'(t)(n + c\Sx,,n-i < oo} • P{Sx,,n-i < oo} 
Furthermore, by (61), 

(65) P^{Sx^ < m'} = P'"{Sx,,n-i <m'- 5J. 
Let r = r(n) = \{2\ogn/\t\)^'^]. 

Case 1 {r <i<n — r). For any k>C, since Vn G Wn^m'-, we have Sy^ > 
m' ■ {k/n) -1 + b{n, fc)^/^'', so 

(66) m' - Sy^ < m' ■ + 1 " K^, if'^'^ 

(67) <A'(t)(n-i) + ^^-6(n,i)V57. 

[We will also use (66) in the case that i <n — r.] When b{n, z)^/^^ > 21ogn/|t|, 
certainly logn/2|t| < \Jn — i/g{n — i), so by (64), (65) and (67) we have 

P"{5., < m'} < P- < A'{t){n - i) - + l| = 6 , 

(68) 

so by linearity of expectation and symmetry, E^'lWjl = 0(l/n) for such i. 
Letting r = r{n) = [(21ogn/|t|)^'^] , then, we have 

j:B-m\=o(^)=o{i). 
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This bounds the bulk of the sum (63); it remains to consider the cases when 
i is either close to 1 or close to n — C. 

Case 2 {n — r<i<n — C). Let k = n — i, so C < k < r. Since k = 
0((logn)5^), m' ■{k/n)=A'{t)k + 0{l), so from (65), (66) and the fact that 
b{n, i) = k we have 

P'"{xi e Wn,m'} < P"{5., < m'} = P{S^,,n-i < m' - 5, J 

= 0{P{S,^,n-i < A'{t)k - b{n, i) V5^}) 

= 0{P{S^^,n-i<A'{t)k-k^/''}), 

which together with (64) gives 

By linearity of expectation and by symmetry, it follows that E"'|PVj| = 
0{e^'' ' )=0{k-^) (recall that t is negative). Therefore 

n-C / r . \ 

i=n—r \k=C / 

Case 3 {1 < i < r). Let Aj = Sy^ — m' ■ (i/n). Since Vn £ Gn,m', neces- 
sarily Aj > — i/n > —1. In order that Xi S Gn^m'y it is necessary that: 

• m' ■ ((n — i)/n) — (Aj + 1) < Sx,,n-i < f^' ■ {{n — i)/f^) — (call this event 
E'i), and that 

• Sxi,n-i abo - (Ai + 1), 

so 

P'^ixi £ Wn,m'} < P'"{X^ G Gn,m'} < P'" {E„ S,^,n-i abo - (A, + 1)}. 

We shall show that 

(69) P^{Ei, S,^,n-^ abo - (A, + 1)} = O [-^^^zr^^) , 

from which it follows just as above that J2l=i 'Ei'^\Wi\ = 0(1). It thus remains 
to prove (69). We first observe that by the same argument used to prove (68), 
we have 

P^S^E,,S.,,n-^ abo -(Ai + l)|Ai>^^| 

2 log n ~ 



<P'"iE, 
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(70) 



logn 



<F<S^^,n-i<A'{t){n-i) 



2\t 




Furthermore, if Aj < 21ogn/|t|, then A^ < \/n — i/g{n — i), so by applying 
first Lemma 11, then Theorem 1, we obtain 



Combining (70) and (72) proves (69) and completes the proof of Lemma 9. 

5. The lower bound. We know that the naive approach to proving a 
lower bound on P{M„ < m}, namely, bounding F{vn G -/V„,m}i then apply- 
ing a union bound, will not work. It is the approach we used in Section 1.1 
when discussing many independent random walks, and only begins to yield 
tail bounds when m < m* + 0(1). 

On the other hand, we observe that we can easily prove strong enough 
bounds for one group of potential nodes of Nn^m when m < m' + 0{1), 
namely, the set Gn,m [we recall that m' = A'{t)n — (3 logn) /2t]. By Lemma 11 
(or, indeed, by Lemma 7), when m is not too far from m', P{vh G Gn,m} = 
0(P{vh S Nn,m}/n)^ from which strong bounds on Y*{Gn,m 7^ 0} follow 
directly from Theorem 1 and a union bound. Intuitively, this bound on Gn,m 
contains almost the whole "reason for" the lower bound, in the following 
sense. Given a node v at depth n and m < m', if S^, < m is to occur, then 
either is "close to being" a leading node, in the sense that there is a 
"small" constant a for which stays above —a, or is "far from being" a 
leading node, in which case there is a "large" constant c for which does 
not stay above — c (or something in between these two scenarios occurs). 
The former scenario is unlikely due to the bounds in Lemma 11, and the 
latter scenario is unlikely due to large deviations estimates (and everything 
in between is unlikely due to a mix of these two reasons). 

We prove our general bound by splitting Nn^m into many groups which 
differentiate among the possibilities outlined in the above sketch. We then 




(71) 
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bound the probability each group is nonempty using large deviations esti- 
mates and the Lemmas from Section 4.2. When we recombine the bounds we 
obtain for the individual groups (by union bound), the result will be strong 
enough to yield bounds on 'P{Nn,m 7^ 0} that are exponentially small in 
m' — m when m is not too far from m' . For convenience, we restate the 
lower bound we are aiming to prove. 

Lemma 13. There are C2 > 0, 82 > such that for all x > we have 
P{M„ <m' -x}< Cae--^^^. 

Theorem 3 follows easily from the above lemma and Corollary 10; its 
proof appears in Section 6, below. In proving Lemma 13, it will be useful 
to use Chernoff [14] bounds as well as Theorem 1, as Chernoff bounds are 
not limited to windows of size o{^/n) around a linear slope. In particular, 
for our choice of t we have: 

Lemma 14 (Chernoff bound). For all r > and all integers k>l, 
P{5,,<A'(t)fc-r}< J. 

Lemma 13 is a rather straightforward consequence of the following, weaker 
lemma, plus Lemma 14: 

Lemma 15. There are C3 > 0, > such that for all m < m' with 
m' — m< (21ogn)/|t|, 

(72) V{^v G 7V„ : m - 1 < 5.,, < m} < Cge-^^^"^'""^) . 

Proof of Lemma 13 assuming Lemma 15. Given m with < m' — 
m < \/n/g{n), if m' — m> 2logn/\t\ then, letting A = A'{t)n — m, by the 
definition of m' we have A > (m' — m)/A. It follows by a union bound and 
by Theorem 1 that 

P{M„ < m} < d"P{5^,„ < A'{t)n - A} = 0(e*^) = 0(e*('"'-'")/^), 

which proves Lemma 13 in the case that m' — m> (21ogn)/|t| (since t is 
negative). If < m' — m < (21ogn)/|t|, then we have the following inclusion: 

{M„<m}c|M„<m'-^^| 

(73) 

L{21ogn)/|t|J 

U U {3v £ Nn-.m' - i - 1 < <m' - i}. 

i=[m'— mj 
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Since m' — (21ogn)/|t| = A'{t)n — (logn)/2|t|, by Lemma 14 and a union 
bound, 

(74) p|m„ < m' - j < ^ = e-^°s"/2 < ^^^-^74, 

I 1^1 J 

By (73), (74), Lemma 15 and a union bound, we thus have 
P{Mn <m}< e*('"'-'")/4 

L{21ogn)/|tIJ 

+ ^ P{3v e Nn-.m' - i - 1 < S^, < m' - i} 

i= [m'—m] 

L(21ogn)/|t|J 
< gt(m'-m)/4_^^^ ^ g-53i 

i=[m'— mj 

which proves Lemma 13 in the case that 0<m' — m < (21og?T,)/|t|. □ 

We now turn our attention to proving Lemma 15. We first note that we 
need only prove (72) for m' — m larger than any fixed constant, as we may 
presume the bound holds for m' — m = 0{1) by our choice of C3. For the 
remainder of the section, we write A = A'{t)n — m and A' = m' — m> L for 
some large constant L. We have A' = A + (31ogn)/2|t| , so A' < (21ogn)/|t| 
and A < (logn)/2|i|. 

We now proceed to define the "homogeneous" groups discussed above, 
and bound the probabilities they are nonempty, in a sequence of claims. The 
proof of each claim will consist of straightforward applications of Theorem 1 
and Lemmas 11 and 12, and Lemma 15 will be a trivial consequence of the 
bounds of the claims. 

Let a = [el*l^'/^'^J , and let Aa be the set of nodes v E Nn for which m — 1 < 
Sy < m and for which {Sy abo — a} occurs. 

Claim 16. P{^a / 0} = 0(e'^*(™'"™)/^°). 

Proof. By Lemma 11, 

/a^\ /g9t(m'-m)/10 

P{vn£Aa\m -l<Sy<m} = 0{ — )= 



n J \ n 



By Theorem 1, it follows that 

'g9i(m'-m)/10 
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The claim follows by a union bound over all v G Nn- □ 

Next, let c = [log^^ n\ , and let He consist of the nodes v G A'^^ with m — 1 < 
< m and for which {Sy abo — c} does not occur. 

Claim 17. P{iJc / 0} = 0(e-(""'-'")). 

Proof. If He ^ then for some < k <n there is a node y G for 
which Sy < mk jn — c. We have 

mk {A'{t)n-A)k Ak 

c = c = A (t)k c. 

n n n 

Therefore, by Lemma 14, 

r mk ^ /pt{c+Ak/n) 

Since c = [log^*^ nj , we may assume n is large enough that c — Ak /n> log^ n, 
so (75) yields 

(76) Pk,<^-c|=0^^ 



By a union bound, we obtain 

n-l 



k=i ^ n J 



\k=l 



□ 



For each integer h with a <b <c, we let M;, be the set of vertices v £ Nn 
for which m — 1 < Sy < m and for which {Sy abo — (6 + 1)} occurs but 
{S^ a6o — (6+ 1)} does not occur. We next define a subset of which we 
call M™''^ by saying that Vn £ M™^'^ if Vn G M;, and additionally, there is k 
with 6^'' < k <n — b^"^ for which S^^, < mk/n — b. We extend this definition 
to all nodes v S A'n by symmetry. 

Similarly, for values k with k < 6^^ or with n — k < 6^^, we define a set 
by saying that v„ G if Vn G M;, and additionally, k is the smallest 
value for which S'l.j. < mk/n — b; again, we extend this definition to all nodes 
V S Nn by symmetry. The sets M™'^ and {M^ : min(A;, n — k)< 6^'''} partition 
Mfe. We now bound the probabilities that these sets are nonempty. 

Claim 18. P{ULa{^r''^ / 0}} = 0(e*("*'~™)/"). 
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Proof. Fix an integer b with a<b <c. By Lemma 12, 

1 



P{vn G Mfo^'^lm -l<S^,^<m} = 
so by Theorem 1, 



n 



69 



P{vn G Mr''} = O 



P{m — 1 < 5"^,^ < m} 



J:(m' —m) - 



(gt(m'— m) 

By a union bound, it follows that 

^t{m' —m) - 



/ t(m-m)\ 



so by summing over b with a<b <c, since a = el*'^™' we obtain 

n' —m)\ / „t(rn' —rn) \ 



{c ~1 / ^ pt(m'-m)\ /„t(m'-m) , 

U / 0} = o E ) = o ( ) = o(e*(-'-)/^' 



Claim 19. P{{}l=a{}f=i^{M!^ + 0}} = ©(e-^'"'-™)). 

Proof. Fix 6 and k as above. For each node x at depth A;, let Wx be 
the set of descendents of x in M^. By a union bound over N^^, P{M^ 7^ 
0} < d'^P{W^,j. / 0}. If is nonempty, then necessarily mk/n — (6+ 1) < 
— Tmk/n — b. Since k < 6^^ < c^^ < (logn)-'^'^^, 

— - 6 = A'(t)fc + ^ - & = A'(t)A: - 6 + o(l). 
n n 

It follows by Lemma 14 that 

nw,, / 0} = e(p{5,, < A'(t)fc - b}) 

By (77) and a union bound, we have that P{M^' / 0} = 0(e*''), so by a 
second union bound 

b=a k=l J \f)=a / '-' 
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Claim 20. P{ULaUfc=rHK"'' + ^l) = 0[e-^^'~^)). 

Proof. Fix h and k as above. For each node x at depth n — k, let Wx 
be the set of descendants of x in M^"-^ By a union bound, 

(78) P{M^-^ / 0} < d"-'=P{tn_, / 0}. 

Suppose is nonempty — then necessarily 

and in addition {S^^^^ abo — b} must occur (or else n — A; is not the first 
time the random walk ending at Vn falls b below its conditional mean). Now, 
by Theorem 1, since A > (31ogn)/2t, 

P{S„„_. < - <,} = P{S,,_. < A'«)(„ - k) - - , 

/ t{A{n-k)/n+b)\ / pt(A+b) 



dn-k^n - k J \d^-^^/n 
tb 

O 



Furthermore, by Lemma 11, 
(80) 



P<i Sv^^k (^bo -b 



ne 



n n 



ot^]=o('- 



n—k J \ 



n 



Combining (79) and (80) yields that 



P{VF^, ^/0} = O — ^ =0, , , 

L "n~k /J y d^^~^ I \ rin — k 

which combined with (78) implies that P{M^~'' / 0} = 0(6^ '"§''+*''). Just 
as in the proof of Claim 19, summing this bound over b and k yields the 
result. □ 



We are now prepared for: 



Proof of Lemma 15. It is immediate from the definitions of the sets 
Aa, He, M™"^ and that any vertex v G Nn with m — 1 < < m is 
either in Aa, or in He, or in M™*^ (for some integer a <b < c), or in 
(for some integer a <b < c and some integer k for which either 1 < k < b^^ 
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ov h — b^"^ < k < h). Applying Claims 16-20, respectively, to bound each of 
these events, it follows that 

P{3v eNn:m-l<Sy<m} = C)(e9*K-'")/iO) + 0(e-(™'-™)) 

+ 0(e*K-"^)/ii) + 0(e-(™'~™)) 
+ 0(e-(™'-™)) 

so the bound of Lemma 15 holds as long as we choose 5i so that < 5i < 
min(|t|/ll, 1) and choose Ci large enough. □ 

6. Proof of Theorem 3. By Corollary 10, there exist Ci > 0, 5i > such 
that for all a; > 0, 

P{M„ > m' + x\S} < Cie-^i^. 

By Lemma 13 and Bayes' formula, there exist C2 > 0, 62 > such that for 
ah X > 0, 

Taking C4 = max{Ci, C2/P{5}} and 6 = min{5i, (52}, we obtain that for all 
a: > 0, 

P{\Mn - m'\ > x\S} < Qe"^^. 
It follows immediately that 

00 

|E{M„|cS} -m'l <E{|M„ -m'llcS} <^P{|M„ -m'l >i\S} 

i=0 



1-e-^' 

proving (7); and letting A = 6*4/(1 — e"''), for all x > we have 

P{|M„ - E{M„,|5}| > x\S} < P{|M„ - m'l > X - A\S} < Qe'^^e"^^, 
so (8) holds with this choice of 6 and with C = C^e^^ . 
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